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ABSTRACT 



[Purpose] In a code-excited linear prediction decoding method, to suppress 

degradations in the coded speech quality due to transmission errors. 

[Constitution] A transmission code from a transmission path is passed through an 

error correcting circuit 30, speech coding information respectively of 
a past, current and future frame are stored in a buffer 31, and code 
error detection information of respectively a past, current and future 
frame are stored in a buffer 34. An error state flag is set up in an 
error state managing portion 35 based on the code error detection 
information in the buffer 34. Based on this error state flag, a linear 
prediction parameter, power parameter and a pitch period, sound 
source gain parameter stored in the buffer 31 are restored by 
interpolation in a parameter decoding portion 32. Furthermore, 
post-filter parameters are changed according to the error state flag in 
the speech decoding portion 33. 



CLAIMS 



1. A speech decoding method used for reconstructing speech during reception, by 

the frame, of an error correction code and speech coding information composed of a 
linear prediction parameter code, a power parameter code, a pitch period code, a noise 
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vector code of a noise codebook and a sound source gain parameter code; wherein code 

error detection information respectively for a past frame, a current frame and a future 

I 

frame is used when coding speech coding information with respect to the frame to be 
decoded, and the speech of the current frame is reconstructed and restored depending on 
an error detection state of each frame. 



DETAILED DESCRIPTION OF THE INVENTION 

Field of Industrial Application 

The present invention relates to a speech decoding method applied, for example, to a 
mobile radio system in which code errors frequently occur on the transmission path, for 
reconstructing speech with little quality degradation due to code errors from speech 
coding information and code error detection information processed with code error 
control technology (error correction/detection coding processing etc.). 



As high-capacity speech coding methods of 8 kbit/s or less for digital mobile radio or 
speech storage services, code-excited linear prediction coding (CELP) and vector 
sum-excited linear prediction coding (VSELP) are typical formats. These technologies 
are respectively described in M. R. Schroeder and B. S. Atal, "Code-Excited Linear 
Prediction (CELP): High-quality Speech at Very Low Bit Rates", Proc. ICASSP-85, 
25.1.1, pp. 937-940 (1985) and I. A. Gerson and M. A. Jasiuk, "Vector Sum Excited 
Linear Prediction (VSELP) Speech Coding 8 kbps", Proc. ICASSP-90, S9.3, pp. 
461-464 (1990). 



Here, CELP coding shall be explained. / In CELP coding, a linear prediction parameter 
code, a power parameter code indicating the power of the frame, a pitch period code, a 
noise code and a sound source gain parameter code are transmitted as a single frame of 
about 5-50 ms as speech coding information. A CELP decoder is shown in Fig. 1. At 
the CELP decoder, the decoded linear prediction parameter A is set as the coefficient of 
the linear prediction synthesis filter 12. Using the excitation vector (residual 
waveform) of the frame obtained immediately previous (in the past) to then and 
decoded pitch period code L, a past excitation vector is cut out from the adaptive 
codebook 10 at that period, and a time sequence vector repeating one frame of this is 
outputted. The code vector indicated by the inputted noise code C is read from the 
noise codebook 11 as a vector. These time sequence vectors are respectively weighted 
in accordance with the gain codes G (= gl, g2) inputted at the multipliers 21, 22, after 
which they are added in the adder 23, the addition output is supplied to the synthesis 
filter 12 as an excitation vector, and the power gain P is multiplied at the multiplier 24, 
thereby obtaining reconstructed speech. Furthermore, the post-filter 13 gives a final 
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speech output with reduced sense of noise due to formant emphasis and pitch emphasis. 

In a speech transmission system applied to a field of application in Wjhich code errors 
occur, loss of quality of the transmitted speech due to coding errors is suppressed using 
error correction coding technology. However, in regions of application such as mobile 
radio in which burst errors frequently occur, it is not possible to obtain an adequate 
application effect even if a strong error correction code is used. This is because it is 
difficult to eliminate all errors even if a strong error correction code is used due to the 
high density of errors in bursts. j 

If speech is decoded from coded information with uncorrectable errors remaining, then 
the decoded speech will be largely distorted. In order to hold down the distortion, an 
error detecting function of checking whether the code of a frame has been correctly 
decoded is provided, and when an error has been detected, a process different from the 
usual decoding method, i.e. waveform restoration, is performed to suppress the 
influence of code errors as much as possible. 

As restoration methods used until now, there are the following two types. The first, for 
example, as described in M. J. McLaughlin, "Channel coding for digital speech 
transmission in the Japanese digital cellular system", IEICE Radio Communications 
Systems Research Society, RCS90-27, pp. 41-45, 1990, is a method wherein the 
parameters of past frames are repeated when the current frame is a frame having code 
errors or a method of repeating the parameters and further controlling the gain of the 
current frame depending on the state of past frames. 

The other restoration method, as described in Japanese Patent Application, First 
Publication No. H4-30200, is a method wherein, when there is an error, the current 
erroneous frame is interpolated from past correct frames and future correct frames. In 
this case, an interpolation with little distortion can be obtained if the past correct frames 
and the future correct frames are close in time. 

Problems to be Resolved by the Invention 

However, restoration from only the past ultimately repeats past speech so as to reduce 
the power of frames with errors and make the distortion indiscernible, in which case the 
current error frame parameters are determined from only past information, so that 
although there is no need for a delay of more than the current frame for restoration, the 
continuity with the next frame is lost, thus causing quality degradation anew. 

Additionally, when the interpolation from past and future correct frames according to 
Japanese Patent Application, First Publication No. H4-30200 is applied to a 
transmission path with frequent burst errors, the frame after the current frame is not 
necessarily a correct frame, and there are cases in which past and future correct frames 
are largely separated in time, thus causing large distortions due to the interpolation. 



Japanese Patent Application Page 4 

First Publication No. H6-12095 

Additionally, when errors persist in several frames after the current frame, the required 
delay time must be made that much longer in order to use correct future frame 
information, thus making it unsuitable for applications such as mobile radio which 
emphasize the real-time property. 

Furthermore, in CELP coding, the adaptive codebook and memory of the synthesis filter 
depend on the decoded results of past frames, and even if there are no coding errors in 
the current frame, if there are errors in past frames, distortions will occur in the 
reconstructed speech. The purpose of the present invention is to offer a speech 
decoding method which resolves these problems and has little quality loss even on a 
transmission path with burst errors. 

Means for Resolving the Problems 

In the present invention, when a code error is detected in the current frame using the 
code error detection information and speech coding information respectively of past, 
current and future frames, during reception, by the frame, of an error correction code 
and speech coding information to reconstruct speech, the speech is reconstructed by 
interpolation and restoration of the parameters of the current frame according to code 
error detection information from past and future code error detection information and 
speech coding information, and when code errors are not detected in the current frame, 
if there is a code error detection in any of the past and future frames, the parameters of 
the current frame are corrected according to the error detection information. 

Functions 

Due to the employment of the above structure, not only the code error detection 
information of the past and current frames, but the code error detection information up 
to the future frames are used to decode the parameters of the current frame when 
decoding a parameter group from speech coding information. 

Embodiments 

Herebelow, the CELP format shall be taken as an example of a high-capacity speech 
coding system, and an embodiment applying the invention to the CELP format shall be 
explained. In the present embodiment, a CELP format decoder shall be described, but 
application to analogous formats or to systems containing analogous structures is also 
possible. 

Fig. 2 is a block diagram of a speech decoding process in an embodiment of the present 
invention. In the present embodiment, decoding using information of past, current and 
future frames up to a single frame shall be described. First, codes transmitted through 
the transmission path are error correction processed by the frame in the error correction 
circuit 30. The speech coding information is sent from 30 to the buffer 31, and error 
detection information to the buffer 34. The buffers 31 and 34 take in information from 
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past and current frames and also information from at least one frame in the future. The 
error detection information from the buffer 34 enters the error state management portion 
35, and the error states of past, current and future frames is decided 1 

The speech coding information which is the output of the buffer 31 and error state from 
the error state management portion 35 are sent to the parameter decoding portion 32, 
where the parameters of the current frame are decoded and interpolated. The speech is 
reconstructed in the speech decoderji by means of the output from the parameter . 
decoder 32. In the error state management portion 35, as shown in Fig. 3, the flag 
groups (S2, SI, SO) representing the states of past, current and future frames are updated 
on the basis of the error detection information until the next frame stored in the buffer - 
34 of Fig. 2. S2 corresponds to the past frame, SI to the current frame and SO to the 
future frame. The flags S2 and SI are the contents of flags SI and SO in the previous 
frame during updating of the frames for speech coding which have been shifted. The 
content of the flag SO is determined by the error detection information of the input 
frame from the corresponding buffer 34 in the state flag updating portion 51 and the 
content of the flag SI (this corresponds to the flag SO of the previous frame). 

Fig. 4 shows an update table for SO. In Fig. 4, when the input to the buffer 34 equals 0, 
this indicates that error correction has been completely achieved in that input frame, 
when the input to the buffer 34 equals 1, this indicates that error correction was not 
possible, i.e. that an error was detected in the frame. When SI equals 0, if no errors 
are detected in the input frame, then SO is made equal to 0, while if an error is detected, 
then SO is made equal to 1. When SI is in the range of from 1 to a predetermined 
number M — 1, if no errors are detected in the input frame then SO is made to equal 0, 
while if an error is detected then SO is made equal to SI + 1, and the number of 
consecutive error frames until the corresponding frame is held as the content of the flag. 
When SI is equal to M, if an error is not detected in the input frame, then SO is made 
equal to M + 1, and if an error is detected, then SO is made equal to M Here, M is the 
length of consecutive frames over which burst-type errors can be considered to exist, e.g. 
6. When SI is equal to M + 1, if no errors are detected in the input frame, then SO is 
made equal to 0, and if an error is detected, then SO is made equal to M. Therefore, 
when SO is equal to M + 1, the case is such that no errors are detected in the 
corresponding frame, but since it is after Consecutive burst errors, it is not made equal to 
0, and is only made equal to 0 when no/^rrors are detected in the subsequent frame. 

With regard to shifting of the state flags in this manner, the document by M. J. 
McLaughlin mentioned among the prior art (IEICE Radio Communications Systems 
Research Society, RCS90-27, pp. 41-45, 1990) takes the current frame as an input, SO as 
the flag of the current frame used in conjunction with the past flags, which greatly 
differs from the present invention wherein the current frame is made SI and the future 
frame is made SO, and the flags of the frame subsequent to the frame to be decoded are 
used. 



Next, using the outputs S (S2, SI, SO) of the error management control portion 35 
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obtained in this way, decoding of the parameters of the speech coding information is 
performed in consideration of the error states of the past, current and future frames in 
the parameter decoding portion 32 of Fig. 2. Fig. 5 shows a flow (Jiagram of a process 
depending on the error state flags in the parameter decoding portion 32. First, in step 
101 of Fig. 5, it is determined whether the error state flag SI of the current frame is 0, 
and if 0, then process 1 of step 111 is performed. In process 1, since SI = 0, SO and S2 
are observed on the basis of parameters decoded based on the speech coding 
information of the current frame, and if SO or S2 is not 0, then corrections are made so 
as to make the parameters connect continuously. 

Next, in step 102, when SI is not equal to 0 and the flag is anywhere from 1 to M, this 
indicates that an error has been detected/in the current frame. Therefore, in this case, 
errors are contained in the speech coding information of the current frame. Thus, the 
information of a past frame and a future frame must be used. In step 103, if SO = 0 or 
M + 1, then the future frame is correct, so that as the process 2 of step 112, the 
parameters of the current frame are interpolated from the parameters used for decoding 
in the past frame and the decoding parameters of the future frame. At this time, the 
weighting of the interpolation can be changed according to the degree of continuity of 
errors in the flag of SI. For example, if SI is close to M, then the parameters of the 
past frame will be parameters obtained by an extrapolation process from process 3 of 
step 113 to be described, with burst errors continuing, so that by making the weighting 
of past parameters small and the weighting of future frames large, it is possible to obtain 
parameters of the current frame with less distortion. In step 103, if SO does not equal 0 
and SO does not equal M + 1, then an error has been detected in a future frame. In this 
case, an extrapolation of parameters used in decoding past frames to the current frame is 
performed in process 3 of step 113. At this time, depending on the degree of 
continuity of errors in the SI flag, it is possible to change the weighting of the 
extrapolation results. 

In step 102, if the result is NO, then SI = M + 1, and this indicates that the current 
frame is correct after at least the last M frames have had burst errors. Accordingly, in 
process 4 of step 114, the parameters decoded based on the speech coding information 
of the current frame are taken as the base, and since S2 = M, there have been burst 
errors in the past frame, and SO is equal to 0 or M, so that the parameters are corrected 
so as to connect continuously in accordance with the value thereof. 

Next, the examples of processes 1-4 of Fig. 5 shall be described in connection with 
actual parameters. In CELP coding, the speech coding information of each frame is 
composed of a linear prediction parameter A, a power parameter code P, a pitch period 
code L and a noise code vector C of a noise codebook, and a gain parameter G of a 
sound source as shown in Fig. 1. First, an example with the linear prediction 
parameter code A shall be given. As a linear prediction parameter, a line spectrum pair 
(LSP) with good linear interpolation properties is used during interpolation. If one 
frame is taken to be 20 ms, the linear prediction parameters will be such as to actually 
require a parameter every subframe unit of about 5-10 ms. In that case, the parameters 
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of the subframes are determined by interpolation from the parameters of the previous 
frame (past frame) or subsequent frame (future frame). Then, if there are code errors 
in the previous (or subsequent) frame, a direct interpolation of erroneous parameters 
will yield a large distortion. If the parameters of the current frame are erroneous, it is 
possible to interpolate the current frame from the previous and subsequent frames as 
long as there are no errors in the previous and subsequent parameters. 

The LSP parameters which are not interpolated immediately after error correction in the 
past (time n - 1), current (time n) and future (time n + 1) frames are respectively taken 
as w(n - 1), w(n) and ui(n + 1), and the parameters corresponding to the times n - 1 and 
n after interpolation in consideration of the error state flags are represented as Q(n - 1) 
and Q(n). Additionally, with respect to a t in the range of a <; t <; b, the interpolation 
parameter Q(r)at the time t when the parameters are Xa at time a and Xb at time b shall 
be defined as follows: 



G(0 = LA(<; (a, Xa), (b, Xb)) 

The function f_Ais a function of interpolating between two points (a, Xa), (b, Xb), for 
outputting an interpolation value for time /. For example, it may represent a linear 
interpolation. 

In process 1 of step 111 in Fig. 5, the error state flag SI of the current frame equals 0, 
and there are no errors in co(/i), therefore the following should be done in accordance 
with the error states of S2 and SO. With regard to t of the current frame, when n - 1 <, t 
<; n, if S2 = 0, then Q(r) = f_A(t; (n - 1, Q(/i - 1)), (/?, «(«))); if S2 = 1, ... , M, then Q(t) 
= f _A('; (n - 1, w(n)), (/i, o</j))). When n <s t <. n + 1, if SO = 0, then Q(t) = f_A(f; (n, 
w(n)), (n + l,w(n + 1))); if SO = 0, 1, ... , M, then Q(r) = f_A(f; («, cu(«)), (n + 1, w 
(«)))• 

In process 2 of step 112 in Fig. 5, w(n) is not used because there is an error in the 
current frame parameter, and since SO = 0 or M + 1, the parameters of the time t in the 
current frame are interpolated according to Q(/) = £_A(t; (n - 1, Q(n - 1)), (n + 1, oo(/j + 
1))) using the parameters Q(n - 1) and co (n + 1). In the process 3 of step 113, SO = 1, 
... , M, so that using the past parameters, the parameters of the time t in the current 
frame are interpolated according to Q(t) = f_A(t; (n - 1, Q(n - 1)), (n + 1, Q(n - 1))). 
In this case, a function which expands the bandwidth of the spectrum of Q(t) depending 
on the values of S2, SI and SO can be used to weaken the resonance properties of the 
LSP parameters in the output, thus having an interpolation which prevents distortion. 

In process 4 of step 114 in Fig. 5, oo(«) can be used, but the previous frame is a burst 
error, either Q(n - 1) or a parameter such as to obtain a flat spectrum as Q0 is used. 
This is taken asQ'. At fin the current frame, when n - 1 <; t ^ n, Q(t) = f_A(/; (n - 1, 
Q'), («, ct)(/i))); and when n <; / =s n + 1, if SO = 0, then Q(0 = f_A(*; (*, w(n)% {n + 1, a> 
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(n + 1))), and if SO = M, then Q(t) = £_A(t; (n, w(n)), (n + 1, u> («))). 

Next, the processing of the power parameter codes in Fig. 5 shall bej considered. This 
is basically the same as interpolation of linear predication parameters. In the case of 
power also, the case wherein power is calculated in units of subframes shall be 
described. The power parameters which are not interpolated immediately after error 
correction in the past (time n - 1), present (time n) and future (time n + 1) are 
respectively taken as p(n - 1), p(n) and p(n + 1), and the parameters corresponding to 
the times n - 1 and n alter interpolation in consideration of the error state flags are taken 
as P(n - 1) and P(n). 

The interpolation parameter P(/) at time t of the power which is Xa at time a and Xb at 
time b is defined as follows: 



P(0 = LPC; X«) ( (6, Xb)) 

The function f_P represents a function such as linear interpolation between the two 
points (a, Xa) and (£>, Xb), or linear interpolation of Xa, Xb expressed logarithmically. 

In process 1 of step 111 in Fig. 5, SI = 0, so there are no errors in p(n). Therefore, at t 
in the current frame, when n - 1 <; t <; n, if S2 = 0, then P(t) = f_P(t; (n - 1, P(« - 1)), («, 
p(n))); and if S2 = 1, ... , M, then P(0 = f_P(f; (n - 1, p(n)), («, p{n))). If the resulting 
P(r) is at least a standard constant (e.g. a & 1) times greater than P(n -1 ), then it is made 
a'P(r) (0 2; a' ^ 1) to limit the power, thus making it impossible to discern distortion 
due to sudden power changes. Additionally, when n <s t =s n + 1, if S0=0, then P(t) = 
f_P(r; p{n)\ {n + 1, p(n + 1))), and if SO = 1, M, then P(t) = fP(f; («, (« + 1, 

PW))- 

In the process 2 of step 112 in Fig. 5, there is an error in the current frame parameters, 
so that p(n) is not used, and since SO = 0 or M + 1, P(n - 1) and p(n + 1) are used to 
interpolate the parameters of the time t in the current frame according to P(/) = f_P(/; (n 
- 1, P(n - 1)), (n + 1, p(n + 1))). Here, if the power change is too large such that with 
respect to a constant threshold value (e.g. a (^ 1)), P(t) > aP(n - 1), then the power is 
substituted by a'P(t) (0 «s a' <; 1) to limit the power, thus making it impossible to 
discern distortion, a' may be changed according to the continuity number of burst 
errors of SI. 



In process 3 of step 113, SO = 1, M, so that using the past parameters, P(t) - f_P(f; 
(n - 1, P(n - 1)), (n + 1, P(n - 1))), and tjie volume of the error frame is weakened to 
replace P(/) with a?(/) (0 =s a' 1). a' may be changed according to the continuity 
number of burst errors of SI. For example, if SI = M, this means that large errors 
have occurred in at least M frames until the current frame, so that a' is made to equal 0 
to form a soundless interval, to keep from outputting distorted speech. 
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In process 4 of step 114 in Fig. 5, p(n) is usable, but the previous frame is a burst error 
frame. Therefore, at t in the current frame, when n — 1 «s t ^ /z, thus P(r) = t^P(*; (n - 1, 
P(n - 1)), (n, p(n))). Then, P(t) is replaced with a'P(r) (0 <; a' <; 1> When nztxn + 
1, if SO = 0, then P(t) = t_P(t; (/*, p(ri)\ (n + l,p(n + 1))); if SO = M, then P(f) = f_P(r; 
{n, p(n)% (n + 1, p(n))). When SO = M, P(t) is made equal to a'P(/) (O^a'i 1). 

Next, an example of parameter decoding of a pitch period code L shall be described. 
The pitch period code L is a delay value indicating the pitch delay of the adaptive 
codebook 10 in Fig. 1. Here, one frame is divided into k (e.g. 2) subframes, and that 
number of the value of L is transmitted as pitch information. The pitch delay is not 
only the pitch period in CELP, but can also have a jumping value of double periods or 
triple periods. Therefore, in the present embodiment, a case wherein k - 2 will be 
explained in connection with a method wherein the value of the pitch period between 
the previous frame and subsequent frame is substituted when erroneous in the current 
frame. Of course, when the variations in the pitch period are small, and the pitch 
period of the previous and subsequent frames can be interpolated, it can be determined 
by interpolation. 

The pitch delays immediately after error correction in a past (time n — 1), current (time 
n) and future frame (time n + 1) frame pre respectively taken as lk{n — 1), lk(n) 7 lk(n + 1), 
{k = 1, 2), and the parameters corresponding to the times n — 1 and n after considering 
error state flags are made Lk(n — 1), Lk(n), (k = 1, 2). In process 1 of step 111 in Fig. 5, 
SI = 0, so that there are no errors in lk(n) y and Lk(n) = lk(n\ (k = 1, 2). 

In process 2 of step 112 in Fig. 5, there is an error in the parameters of the current frame, 
so that lk(ji) is not used, and L1(h) - L2(n -1 ), L2(/i) = ll(n + 1). In proces 3 of step 
113, SO = 1, M, so using past parameters, Lk(n) = L2(n - 1), (k = 1, 2). 

In process 4 of step 114, the current frame is correct, so that as in process 1, Lk(n) = 
lk{n), (k=\, 2). Next, a decoding example for the sound source gain parameter G 
shall be described. In Fig. 1, G is a general term of the group of gains gl, g2 
multiplied to the respective sound source signals selected in the adaptive codebook 10 
and noise codebook 11. With regard to this Q parameter decoding is performed in a 
manner similar to the pitch delay. The sound source gains immediately after error 
correction in the past (time n - 1), current (time ri) and future (time n + 1) frames are 
respectively taken as ak(n — 1), ok(n), ok(n + 1), (k = 1, 2), and the parameters 
corresponding to the times n - 1 and n after considering the error state flags are taken as 
Gk(n - 1), Gk(n), (k=l, 2). 

In process 1 of step 111 in Fig. 5, SI = 0 and there is no error in ok(n) y so Gk(n) = ok(n), 
(k = 1, 2). In process 2 of step 112 in Fig. 5, there are errors in the current frame 
parameters, so ok(n) is not used, and Gl(n) = G2(n - 1), G2(n) = al(n + 1). 

In process 3 of step 113, SO = 1, ... , M, so using the past parameters, Gk(n) = G2(n - 1), 
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(k = 1, 2). In process 4 of step 114, the current frame is correct, so as in process 1, 
Gk(n) = ok(n), (£=1,2). 

! 

Concerning the decoding of the noise vector code C of the noise codebook, there is not 
much correlation between frames, so that in the present embodiment, the codes sent to 
the current frame are directly used for reproduction without regard to the error state 
flags. In this way, in the parameter decoding portion 32 of Fig. 2, the parameters of 
the speech coding information are decoded and restored based on the past, current and 
future error state flags. The result is turned over to the speech decoder 33 to recreate 
the speech. 

The distortion due to transmission errors can be reduced in the speech decoder 33 as 
well by using past, current and future frame error state flags S (= S2, SI, SO) from the 
error state managing portion 35 to control the reconstructed speech of the decoded 
speech, thereby reducing distortion due to transmission errors. That is, in the 
post-filter 13 of Fig. 1, the coefficient of the post-filter is controlled by using S to 
suppress auditory distortions. Normally, the post-filter is composed of H(z) as follows. 

H( Z ) = (1- n Z~ l )A(Z/ r 1 ) 

((1- SB(z))A(z/ 7 2)) 

n 

A(z) = Z a iz~ l 
N 

B(z) = 2 $ l2~ T - 1 
M 

Here, cu is the linear prediction coefficient for each frame, pi is the long-term prediction 
coefficient and x is the pitch delay. These are determined adaptively from the 
transmission parameters or decoded speech. Formant emphasis is performed with 
A(z/y1)/A(z/y2), pitch emphasis is performed with 1/(1 - 5B(z)), and a spectral slant 
correction effect is provided by 1 - v\z l . In the present embodiment, H(z) is adaptively 
varied in each frame with r\ 9 6, yl and yl as functions of the error state flags S(= S2, SI, 
SO). When S2, SI and SO are all either 0 or M + 1, normal values are used, but when 
any of S2, SI or SO is not 0 or M + 1 and there is an error frame, there is a high 
probability of errors being contained in the adaptive codebook and noise code vector, 
where the noisiness of the output speech can be suppressed by reducing r\ and 6, 
weakening the pitch emphasis of the sound source and high-region spectral slant, and 
increasing the degree of formant emphasis by adjusting yl and y2. Here, by using 
error state flags up to the future frame of SO, it is possible to continuously perform 
switching of r|, 5, yl and y2 better than with only S2 and SI, thus enabling the 
suppression of degradations due to sound quality changes caused by switching between 
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frames. 

Effects of the Invention i 

As described above, according to the present invention, it is possible to suppress 
unpleasant sonic distortions which are superimposed on the decoded speech when code 
errors from the transmission path remain on the speech coding bits. 

BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 A block diagram of a CELP speech decoder. 

Fig. 2 A block diagram of a speech decoding process according to an embodiment of 
the present invention. 

Fig. 3 An enlarged view of the error state managing portion 5 of Fig. 2. 

Fig. 4 An updating table for the SO flag in the state flag updating portion 51 of Fig. 3. 

Fig. 5 A schematic diagram of the processing flow depending on the error state flags. 
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